FactRank: Random Walks on a Web of Facts
نویسندگان
چکیده
Fact collections are mostly built using semi-supervised relation extraction techniques and wisdom of the crowds methods, rendering them inherently noisy. In this paper, we propose to validate the resulting facts by leveraging global constraints inherent in large fact collections, observing that correct facts will tend to match their arguments with other facts more often than with incorrect ones. We model this intuition as a graph-ranking problem over a fact graph and explore novel random walk algorithms. We present an empirical study, over a large set of facts extracted from a 500 million document webcrawl, validating the model and showing that it improves fact quality over state-of-the-art methods.
منابع مشابه
A PRELUDE TO THE THEORY OF RANDOM WALKS IN RANDOM ENVIRONMENTS
A random walk on a lattice is one of the most fundamental models in probability theory. When the random walk is inhomogenous and its inhomogeniety comes from an ergodic stationary process, the walk is called a random walk in a random environment (RWRE). The basic questions such as the law of large numbers (LLN), the central limit theorem (CLT), and the large deviation principle (LDP) are ...
متن کاملConstructing a sequence of random walks strongly converging to Brownian motion
It is one of the most basic facts in probability theory that random walks, after proper rescaling, converge to Brownian motion. However, Donsker’s classical theorem [Don51] only states a convergence in law. Various results of almost sure convergence exist (see e.g. [KMT75, KMT76] and the references therein) but involves rather intricate relations between the converging sequence of random walks ...
متن کاملUpper and Lower Space-time Envelopes for Oscillating Random Walks Conditioned to Stay Positive
We provide integral tests for functions to be upper and lower space time envelopes for random walks conditioned to stay positive. As a result we deduce a `Hartman-Winter' Law of the Iterated Logarithm for random walks conditioned to stay positive under a third moment assumption. We also show that under a second moment assumption the conditioned random walk grows faster than n 1=2 (log n) ?(1+")...
متن کاملWIT: Web People Search Disambiguation using Random Walks
In this paper, we describe our work on a random walks-based approach to disambiguating people in web search results, and the implementation of a system that supports such approach, which we used to participate at Semeval’07 Web People Search task.
متن کاملContinuous-Time Quantum Walks on the Symmetric Group
In this paper we study continuous-time quantum walks on Cayley graphs of the symmetric group, and prove various facts concerning such walks that demonstrate significant differences from their classical analogues. In particular, we show that for several natural choices for generating sets, these quantum walks do not have uniform limiting distributions, and are effectively blind to large areas of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010